AITopics | dynamic team composition

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 03:28:45 GMT

agent, artificial intelligence, machine learning, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

25b040c97a75021e57100648a20b1e10-Paper-Conference.pdf

Neural Information Processing SystemsApr-25-2026, 03:28:42 GMT

Add feedback

Mutual-Information Regularized Multi-Agent Policy Iteration

Neural Information Processing SystemsApr-24-2026, 11:28:45 GMT

Despite the success of cooperative multi-agent reinforcement learning algorithms, most of them focus on a single team composition, which prevents them from being used in more realistic scenarios where dynamic team composition is possible. While some studies attempt to solve this problem via multi-task learning in a fixed set of team compositions, there is still a risk of overfitting to the training set, which may lead to catastrophic performance when facing dramatically varying team compositions during execution. To address this problem, we propose to use mutual information (MI) as an augmented reward to prevent individual policies from relying too much on team-related information and encourage agents to learn policies that are robust in different team compositions. Optimizing this MI-augmented objective in an off-policy manner can be intractable due to the existence of dynamic marginal distribution. To alleviate this problem, we first propose a multi-agent policy iteration algorithm with a fixed marginal distribution and prove its convergence and optimality. Then, we propose to employ the Blahut-Arimoto algorithm and an imaginary team composition distribution for optimization with approximate marginal distribution as the practical implementation. Empirically, our method demonstrates strong zero-shot generalization to dynamic team compositions in complex cooperative tasks.

machine learning, reinforcement learning, team composition, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.34)

Add feedback

Self-OrganizedGroupforCooperativeMulti-agent ReinforcementLearning

Neural Information Processing SystemsFeb-18-2026, 23:37:11 GMT

The framework of centralized training with decentralized execution (CTDE) [8,28]isone ofthe popular frameworks for solving cooperative multi-agent tasks.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

Europe > Netherlands > South Holland > Leiden (0.04)
Asia > China > Beijing > Beijing (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.35)

Add feedback

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 23:13:07 GMT

agent, conductor, dynamic team composition, (15 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

0799492e7be38b66d10ead5e8809616d-Paper-Conference.pdf

Neural Information Processing SystemsFeb-7-2026, 13:15:13 GMT

composition, information, team composition, (15 more...)

Neural Information Processing Systems

Country: Asia > China (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)

Add feedback

Mutual-Information Regularized Multi-Agent Policy Iteration

Neural Information Processing SystemsDec-23-2025, 19:28:48 GMT

Despite the success of cooperative multi-agent reinforcement learning algorithms, most of them focus on a single team composition, which prevents them from being used in more realistic scenarios where dynamic team composition is possible. While some studies attempt to solve this problem via multi-task learning in a fixed set of team compositions, there is still a risk of overfitting to the training set, which may lead to catastrophic performance when facing dramatically varying team compositions during execution. To address this problem, we propose to use mutual information (MI) as an augmented reward to prevent individual policies from relying too much on team-related information and encourage agents to learn policies that are robust in different team compositions. Optimizing this MI-augmented objective in an off-policy manner can be intractable due to the existence of dynamic marginal distribution. To alleviate this problem, we first propose a multi-agent policy iteration algorithm with a fixed marginal distribution and prove its convergence and optimality. Then, we propose to employ the Blahut-Arimoto algorithm and an imaginary team composition distribution for optimization with approximate marginal distribution as the practical implementation. Empirically, our method demonstrates strong zero-shot generalization to dynamic team compositions in complex cooperative tasks.

composition, mutual-information regularized multi-agent policy iteration, team composition, (6 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.97)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.86)

Add feedback

Mutual-Information Regularized Multi-Agent Policy Iteration

Neural Information Processing SystemsOct-9-2024, 11:55:19 GMT

Despite the success of cooperative multi-agent reinforcement learning algorithms, most of them focus on a single team composition, which prevents them from being used in more realistic scenarios where dynamic team composition is possible. While some studies attempt to solve this problem via multi-task learning in a fixed set of team compositions, there is still a risk of overfitting to the training set, which may lead to catastrophic performance when facing dramatically varying team compositions during execution. To address this problem, we propose to use mutual information (MI) as an augmented reward to prevent individual policies from relying too much on team-related information and encourage agents to learn policies that are robust in different team compositions. Optimizing this MI-augmented objective in an off-policy manner can be intractable due to the existence of dynamic marginal distribution. To alleviate this problem, we first propose a multi-agent policy iteration algorithm with a fixed marginal distribution and prove its convergence and optimality.

composition, mutual-information regularized multi-agent policy iteration, team composition, (3 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Filters

Collaborating Authors

dynamic team composition

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

25b040c97a75021e57100648a20b1e10-Paper-Conference.pdf

Mutual-Information Regularized Multi-Agent Policy Iteration

Self-OrganizedGroupforCooperativeMulti-agent ReinforcementLearning

25b040c97a75021e57100648a20b1e10-Supplemental-Conference.pdf

0799492e7be38b66d10ead5e8809616d-Paper-Conference.pdf

Mutual-Information Regularized Multi-Agent Policy Iteration

Mutual-Information Regularized Multi-Agent Policy Iteration